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ABSTRACT 



Disclosed are a family of synthetic proteins having 

, affinity for a preselected antigen. The proteins are char- 
acterized by one or more sequences of amino acids 
constituting a region which behaves as a biosynthetic 
antibody binding site (BABS). The sites comprise 1) 
non-covaleritly associated or disulfide bonded synthetic 
Vif and Vl dimers, 2) Vh-Vl or Vi-Vm single chains 
wherein the V# and Vl are attached by a polypeptide 
linker, or 3) individuals V# or V/, domains. The bindbg 
domains comprise linked CDR and FR regions, which 
may be derived from separate immunoglobulins. The 
proteins may also include other polypeptide sequences 
which function eg., as an enzyme, toxin, binding site, or 
site of attachment to an immobilization media or radio- 

■ active atom. Methods are disclosed for producing the 
proteins, for designing BABS having any specificity 

. that can be elicited by in vivo generation of antibody, 

[ and for producing analogs thereof. 
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agents. They arc routinely produced according to estab* 
BIOSYNTHETIC ANTIBODY BINDING SITES listed procedures by hybridomas generated by fusion of 

mouse lymphoid cells with an appropriate mouse my- 
The United States Government has certain rights in eloma cell line. Human monoclonal antibodies arc diffi- 
thts application as the subject matter hereof was devel- 5 cult to produce by cell fusion techniques since, among 
oped in part using funds from SBIR Grant Nos. SSS-4 other problems, human hybridomas are notably unsta- 
1 R43 CA39870-01 and SSS-4 2 R44 CA39870-02. ble, and removal of immunized spleen cells from hu- 

RPFFRFMrF TO RFI ATFH APP1 IPATTniSTC maDS " ROt fettible 35 iX * f0f r0denlS - Monoclonals 

REFERENCE TO RELATED APPLICATIONS wWch havc speciflcilies c f significant therapeutic value 
This application is a divisional of copending applies- JO are generally of murine or rat origin, and are therefore 
tion Sen No. 213,761 filed June 30, 1988, which is a immunogenic to the human immune system, 
continuation of copending application Ser. No. 052,800 Chimeric antibodies composed of human and non- 
filed May 21, 1987. Related applications include: Ser. human amino acid sequences potentially have improved 
No. 636,770 filed Jan. 2, 1991, which is a second divi- therapeutic value as they presumably would elicit less 
sionaJ of 213,761; Ser. No. 213,671 filed June 30 t 1988, 15 circulating human antibody against the non*human im- 
which also is a continuation of 052,800, and Ser. No. munoglobulin sequences. Accordingly, hybrid antibody 
342,449, filed Feb. 6 r 1989, which is a continuation-in- molecules have been proposed which consist of immu- 
part of 052,800. noglobulin light and heavy chain amino acid sequences 

HArvnttnrrwn np m ixjVFMXinisi from difierent mammalian sources. The chimeric ami- 

BACKGROUND OF THE INVENTION 20 designe d thus far comprise variable regions from 

This invention relates to novel, compositions of mat- one mammalian source, and constant regions from 
ter, hereinafter called biosynthetic antibody binding human or another mammalian source (Morrison et al. 
sites or BABS, useful, for example, in specific binding 1984, Proc. Natl. Acad. Sci. U.S.A., 81:5851-6855; Neu 
assays, affinity purification, biocatalysis, drug targeting, berger et al., 1984, Nature 312:604-608; Sahagan et al 
imaging, immunological treatment of various oncogenic 25 1986, J. Immunol. 137:1066-1074; EPO application nos 
and infectious diseases, and in other contexts. More 84302368,0, Genentech; 85102665.8, Research Develop 
particularly, this invention relates to biosynthetic poly- ment Corporation of Japan; 85305604.2, Stanford; P.C 
peptides having a structure similar to native antibody T, application no. PCT/GB85/00392, Celltech Lim- 
binding sites, DNAs encoding the polypeptides pre- ited). 

pared by recombinant DNA techniques, vectors com- 30 It has been reported that constant regions are not 
prising these DNAs, and methods for the design and required for antigen recognition or binding; these prop- 
production of these polypeptides. erties have been localized to the variable domains of the 

Antibodies are proteins belonging to a group of im- antibody molecule located at the amino terminal end of 
munoglobulins elicited by the immune system in re- both the heavy and light chains. The variable regions 
sponse to a specific antigen or substance which the body 35 remain noncovalently associated (as Vt? dimers, 
deems foreign. Antibodies can both recognize and bind terrned Fv regions) even after proteolytic cleavage 
that antigen, and are involved in a number of effector from the native antibody molecule, and retain much of 
reactions such as complement fixation and allergic re- their antigen recognition and binding capabilities (Inbar 
sponses. et at, Proc Natl. Acad. Sci. U.S.A., 1972, 69:2659-2662; 

There are five classes of human antibodies which 40 Hochman et. al., 1973, Biochem. 12 1130-1135 and 
have the ability to selectively recognize and preferen- 1976, Biochem. 15:2706-2710; Sharon and Givol, 1976, 
tially bind a specific antigen. Each antibody class has Biochem. 15:1591-1-594; Rosenblatt and Habcr, 1978, 
the same basic structure (see FIG. 1), or multiples Biochem. 17:3877-3882; Ehrlich et al,, 1980, Biochem. 
thereof, consisting of two identical polypeptides called 19:4091-40996). 

heavy or H chains (molecular weight in IgG approxi* 45 cttviuadv nu tot? Tmrcxmnw 

mately 50,000 d each) and two identical polypeptides SUMMARY OF THE INVENTION 

called light or L chains (molecular weight approxi- A class of novel biosynthetic polypeptides has now 
mately 25,000 d each). Each of the five antibody classes been designed and engineered which comprise biosyn- 
has a similar set of light chains and a distinct set of thetic antibody binding sites, that is, "BABS" or chime- 
heavy chains. A light chain is composed of one variable 50 ric polypeptides defining stucture capable of selective 
and one constant domain, while a heavy chain is com- antigen recognition and preferential antigen binding, 
posed of one variable and three or more constant do- In its broadest aspects, this invention features poly- 
mains. The variable domains determine the specificity peptides comprising biosynthetic antibody binding sites, 
of the immunoglobulin, the constant regions have other DNA encoding these polypeptides prepared by recom- 
functions. 55 bin ant DNA techniques, vectors comprising these 

Amino acid sequence data indicate that each variable DNAs, and methods for the production of these poly- 
domain comprises three hypervariable regions flanked peptides. , 
by four relatively conserved framework regions (Kabat In one aspect, the invention is based on the observa- 
et. al., Sequences of Proteins of Immunological Interest tion that three subregions of the variable domain of each 
[U.S. Department of Health and Human Services, third 60 of the heavy and Sight chains of native immunoglobulin 
edition 1983, fourth edition, 1987]). The hypervariable molecules collectively are responsible for antigen rec- 
regions have been assumed to be responsible for the ognition and binding. Each of these subregions, called 
binding specificity of individual antibodies and to ac- herein "complementarity determining regions" or 
count for the diversity of binding of antibodies as a CDRs, consists of one of the hypervariable regions or 
protein class. 65 loops and of selected amino acids or amino acid sequcn- 

Monoclonal antibodies, or homogeneous antibodies ces disposed in the framework regions which flank that 
of identical genetic parentage and binding specificity, particular hypervariable region. It has now been dis- 
havc been useful both as diagnostic and therapeutic covered that framework regions from diverse species 
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are effective to maintain CDRs from di verse other spe- itself is bioactive to produce a bifunctional or multifunc- 
cies in proper conformation so as to achieve true immu- tionaJ protein. For example, the synthetic Fv may in- 
nochemical binding properties in a biosynthetic protein. elude a leader or trailer sequence defining a polypeptide 
Thus, BABS produced in accordance with the inven- having enzymatic activity, independent affinity for an 
tion comprise biosynthetically produced novel sequen- 5 antigen different from the antigen to which the chimeric 
ces of amino acids defining polypeptides designed to Fv is directed* or having other functions such as to 
bind with a preselected antigenic material. The struc- provide a convenient site of attachment for a radioac- 
ture of these synthetic polypeptides is unlike that of tive atom, or simply to enhance expression in procary- 
naturally occurring antibodies, fragments thereof, or otic host cells or yeasts. 

known synthetic polypeptides or "chimeric antibodies" 10 Such tandem arrangement of V// and Vj. polypep- 
in that the regions of the BABS responsible for specific- tides can increase the stability of the antigen binding site 
ity and affinity of binding, (analogous to native antibody and facilitate its coupling to proteins utilized in drug 
variable regions) are themselves chimeric, e.g., com- targeting and moieties useful in imaging. The therapeu- 
prise amino acid sequences homologous to portions of tic use of such chimeric Fvs provide a number of advan- 
at least two different antibody molecules. 15 tages over larger fragments or complete antibody mole- 

The invention thus provides a chimeric polypeptide culest they are often quite stable and less immunogenic; 
defining a region capable of selective antigen binding they can penetrate body tissues more rapidly for pur- 
and recognition. This chimeric polypeptide comprises poses of imaging or drug delivery because of their 
amino acid sequences homologous to portions of the smaller size; and they can facilitate accelerated clear- 
CDRs of the variable domain of one immunoglobulin 20 ance of targeted isotopes or drugs, 
light or heavy chain, and other sequences homologous Other embodiments of the invention comprise multi- 
to the framework regions, or FRs, of the variable do- functional polypeptides consisting of one or more single 
main of a second, different immunoglobulin light or chain Fvs either linked V# and V/,dimers, individual 
heavy chain. Polypeptides so constructed bind a spe- V/. or V#, Or any of the foregoing comprising CDRs 
cific preselected antigen determined by the CDRs. Pref- 25 and FRs from different or the same immunoglobulins, 
erably, the chimeric polypeptides comprise an amino linked to a second functional protein domain such as, 
acid sequence homologous to at least a portion of the for example, a toxin, enzyme, or site of attachment to an 
variable regions of a mammalian immunoglobulin, such immobilization matrix. Yet another embodiment is a 
as those of mouse, rat, or human origin. In one preferred polypeptide comprising several identical or non-identi* 
embodiment, the biosynthetic antibody binding site 30 cal BABS which recognize a group of antigenic deter- 
comprises FRs homologous with a portion of the FRs minants that are periodic or closely spaced in their 
of a human immunoglobulin and CDRs homologous normal environment, e.g., on a cell surface. This ar- 
with CDRs from a mouse immunoglobulin. This type of rangement confers greatly augmented affinity and/or 
chimeric polypeptide displays the antigen binding sped- specifically on the BABS-containing protein analogous 
ficity of the mouse immunoglobulin, while its human 35 to, for example, the way IgM (containing 10 Fabs) binds 
framework minimizes human immune reactions. In ad* to the surfaces of certain cells, 
dition, the chimeric polypeptide may comprise other In other aspects, the invention provides DNA se- 
amino acid sequences. It may comprise, for example, a quences encoding chimeric polypeptides of the type 
sequence homologous to a portion of the constant do- described above, vectors including such sequences, and 
main of an immunoglobulin, but preferably is free of 40 methods employing the DNAs and vectors for produc- 
constant regions (other than FRs). ing the polypeptides. 

The invention also provides a single chain composite A novel method of producing BABS involves the 
polypeptide having antigen binding abilities, and com- construction of a DNA containing three polynuceiotide 
prising a pair of amino acid sequences homologous or sequences (Xi, X2 and Xa). Each of the sequences con- 
analogous respectively to the variable regions of an 45 tain restriction sites proximal its 3' and 5' ends t and each 
immunoglobulin light and heavy chain, (linked V#-Vj. is flanked by polynucleotide sequences (FRj, FR 2 , FR3 
or single chain Fv). Both Vjyand V/,may copy natural and FR4) encoding selected framework region (FR) 
monoclonal sequences, or one or both of the chains may amino acid sequences homologous to a portion of the 
comprise a CDR-FR construct of the type described variable domain of an immunoglobulin. This DNA has 
above. The separate polypeptides analogous to the van* 50 the structure: 
able regions of the light and heavy chains are held to- 
gether by a polypeptide linker Ri-FRi-Xi-FR2-X>-FR3-X3-FiU'R2 

This type of chimeric polypeptide is thus a single 
chain composite polypeptide comprising a complete where K\ is a 5' phosphate group or polynuceiotide 
antibody binding site. This single chain composite poly- 55 sequence and R2 is a 3' hydroxyl group or polynudco* 
peptide has a structure patterned after tandem V// and tide sequence. The X polynucleotide sequences may be 
Vl domains, but with the carboxyl terminal of one selectively excised using restriction enzymes 7 and re- 
attached through an amino add sequence to the ammo placed by other DNA sequences encoding the CQR 
terminal of the other. It thus comprises an amino acid amino acid sequences of a variable domain of a selected 
sequence which is homologous to a portion of the van- 60 immunoglobulin. This type of DNA sequence may en- 
able region of an immunoglobulin heavy chain (Vu) code at least part of the variable region of either or both 
peptide bonded to a second amino acid sequence which a heavy or light chain of an immunoglobulin and may, 
is homologous to a portion of the variable region of an in addition, comprise a third phosphodiestcr-linked 
immunoglobulin light chain (Vi)< The linking amino nucleotide or polynucleotide sequence of a nature and 
acid sequence may or may not itself be antigenic or 65 function described above. 

biologically active. In addition, either the amino or In yet another aspect, the invention provides a 
carboxyl terminal ends of these chimeric, single chain method for producing intact biosynthetic antibody 
Fvs may be attached to an amino acid sequence which binding sites or native Fv free of all or substantially all 
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constant region amino acids. The method involves en- chain of the same BABS; (4£) a heavy chain variable 

zymatic digestion of chimeric immunoglobulin or at region of a BADS comprising CDRs of glp-4 and FRs 

least Fab regions which have been engineered to con- of NEWM; and (4F) a light chain variable region com- 

tain preferential proteolytic cleavage sites located be- prising CDRs of glp-4 and FRs of NEWM. Delineated 

tween the variable and constant regions of the immuno- 5 are FRs, CDRs, and restriction sites for endonuclease 

globulin heavy and light chains. Digestion of the intact digestion, most of which were introduced during design 

immunoglobulin with the appropriate protease yields a of the DNA. 

complete antigen binding site or Fv fragment. This FIG. 5 is the nucleic acid and encoded amino acid 

approach works well in myeloma or hybridoma expres- sequence of a host DNA (V//) designed to facilitate 
tion systems. 10 insertion of CDRs of choice. The DNA was designed to 

Accordingly, it is an object of this invention to pro- have unique 6-base sites directly flanking the CDRs so 

. vide novel proteins comprising biosynthetic antibody that relatively small oligonucleotides defining portions 

binding sites including an amino acid sequence homolo* of CDRs can be readily inserted, and to have other sites 

gous to specific portions of the variable region of immu- to facilitate manipulation of the DNA to optimize bind- 

noglobulin light chain and/or heavy chain, to provide IS ing properties in a given construct. The framework 

DNA sequences which encode the biosynthetic anti- regions of the molecule correspond to mouse FRs (c,f, 

body binding sites, and to provide replicable expression FIG. 4A), 

vectors capable of expressing DNA sequences encoding FIG. 6 is a protein constructed in accordance with ' 

the biosynthetic antibody binding sites. Another object the invention comprising FB-Asp-Pro-V/r(Gly4-Ser)3- 

is to provide a generalized method for producing bio- 20 FB is the FB fragment of protein A, here used as a 

synthetic antibody binding site polypeptides of any leader, and constituting a binding site for Fc, Asp-Pro is 

desired specificity. a dilute acid cleavage site, and the remainder of the 

RRIFF r>F<;riUPTlON OF THF DRAWING comprises a "Ugle chain BABS comprising 

BRIEF DESCRIPTION OF THE DRAWING ^ Vfg md y.^ chains of mou&c monoc | onal 2 6-10 

The foregoing and other objects of this invention, the 25 linked together with a 15 amino acid sequence. V'z, is 

various features thereof, as well as the invention itself, the Vj. of mouse monoclonal 26-10 altered at residue 4 

may be more fully understood from the following de- where valine replaces methionine. This construct binds 

scription, when read together with the accompanying both Fc and digoxin. 

drawings. FIG. 7 is a graph of percent of undiluted units bound 

FIG. 1A is a schematic representation of an intact 30 versus concentration comparing the binding of native 

IgG antibody molecule containing two light chains, 26-10 and the construct of FIG, 6 and FIG. 2 A rena- 

each consisting of one variable and one constant do* tured using two different procedures. Plot 3 represents 

main, and two heavy chains, each consisting of one the data for the native 26-10 antibody; plot 1 represents 

variable and three constant domains. data from the construct of FIGS. 6 and 2A renatured 

FIG. IB is a schematic drawing of the structure of Fv 35 using the slow folding procedures described herein; and 

illustrating V// and Vj. domains, each of which com- plot 2 represents data from the same construct rena* 

prises four framework (FR) regions and three comple* tured using the fast dilution/quick refolding procedure 

mentarily determining regions (CDR). Boundaries of disclosed herein* 

CDRs are indicated, by way of example, for mono- In FIGS, 4A-4E and 6, the amino acid sequence of 

clonal 26-10, a well known and characterized murine 40 the expression products start after the GAATTC se- 

monoclonal specific for digoxin. ,quences, which codes for an EcoRI splice site, trans- 

FIG. 2A-2D are schematic representations of some lated as Glu-Phe on the drawings, 

of the classes of reagents constructed in accordance r - 

with the invention, each of which comprises a biosyn- Description of the Invention 

thetic antibody binding site. 45 As is now well known, Fv, the minimum antibody 

FIG. 3 discloses five amino acid sequences (heavy fragment which contains a complete antigen recogni- 

chains) in single letter code lined up vertically to facili- tion and binding site, consists of a dimer of one heavy 

tate understanding of the invention. Sequence 1 is the and one light chain variable domain in tight, noncova* 

known native sequence of V# from murine monoclonal lent association (FIG. 1). It is in this configuration that 

glp-4 (antMysozyme). Sequence 2 is the known native 50 the three complementarity detennining regions of each 

sequence of V// from murine monoclonal 26-10 (anti- variable domain interact to define an antigen binding 

digoxin). Sequence 3 is a BABS comprising the FRs site on the surface of the Vj^Vt dimer. Collectively, the 

from 26-10 V# and the CDRs from glp-4 V#. The six complementarity determining regions (see FIG. IB) 

CDRs are identified in lower case letters; restriction confer antigen binding specificity to the antibody. FRs 

sites in the DNA used to produce chimeric sequence 3 55 flanking the CDRs have a tertiary structure which is 

are also identified. Sequence 4 is the known native se- essentially conserved in native immunoglobulins of 

quence of V/r from human myeloma antibody NEWM. species as diverse as human and mouse. These FRs 

Sequence 5 is a BABS comprising the FRs from serve to hold the CDRs in their appropriate orientation. 

NEWM V# and the CDRs from glp-4 Vh\ it., illus- The constant domains are not required for binding funo 

tratcs a binding site having a human framework but an 60 tion, but may aid in stabilizing Vr-Vl interaction. Even' 

affinity for lysozyme similar to glp-4. a angle variable domain (or half of an Fv comprising 

FIGS. 4A-4F are the synthetic nucleic acid sequen- only three CDRs specific for an antigen) has. the ability 
ces and encoded amino acid sequences of (4A) the to recognize arid bind antigen, although at a lower affin- 
heavy chain variable domain of mouse anti-digoxin ity than an entire binding site (Painter et ah, 1972, Bio- 
monoclonal 26-10; (4B) the light chain variable domain 65 chem. 11: 1327-1337). 

of mouse anti-digoxin monoclonal 26-10; (AC) a heavy This knowledge of the structure of immunoglobulin 

chain variable domain of a chimeric Fv (BABS) com- proteins has now been exploited to develop biosynthetic 

prising CDRs of glp-4 and FRs of 26-10; (4D) a light antibody binding sites provided by this invention. 



5,091,513 

7 8 

The biosynthetic antibody binding sites embodying Other sections, e.g., C# and Q,, or native immuno- 

the invention are biosynthetic in the sense that they are globulin protein structure need not be present and nor- 

synthesized in a cellular host made to express a syn- tnally are intentionally omitted from the biosynthetic 

thetic DNA, that is, a recombinant DNA made from proteins of this invention. However the BA6S of the 

ligation of plural, chemically synthesized oligonucleo- 5 invention may comprise additional polypeptide regions 

tides, or by ligation of fragments of DNA derived from defining a Woactive region, e.g., a toxin or enzyme, or a 

the genome of a hybridoma, mature B cell clone, or a ate onto which a toxin or a remotely detectable sub- 

cDNA library derived from such natural sources, The stance can be attached. 

proteins of the invention are properly characterized as The clinical administration of the BABS of the inven- 

"antibody binding sites" in that these synthetic mote- »> tion, which display the activity of native, relatively 

cules are designed specifically to have at least some sm!ul Fv > V A or Vj, fragments, affords a number of 

affinity for a preselected antigenic substance. The poly- advantages over the use of larger fragments or entire 

peptides of the invention are antibody-like in that their antibody molecules. The BABS of this invention offer 

structure is patterned after regions of native antibodies cleavage sites to circulating proteolytic enzymes 

known to be responsible for antigen recognition. 15 « nd **» ° a ' er "*d*y- They reach their target 

More specifically, the structure of these biosynthetic «■"■ morerapidly, and are cleared more quickly from 

proteins in the region which impart the binding proper- the body. They also have reduced mimunogemcity. In 

ties to the protein, is analogous to the Fv region of a ad * tlon ' sma,ler ■« »«htates coupling to other 

natural antibody, It comprises a series of regions con- m mo jf u . ,es m dnig targeting and imaging application, 

sisting of amino acids defining at least three polypeptide 20 ™VT Bt, °?. *"» ^es "tact biosynthetic ant., 

segments which together form the tertiary molecular ***** bmdm 8 s,tes md °?™. ^"'X^T^ v^a 

structure responsible for affinity and binding. These nonpatent y associated, disulfide bonded or Jinked 

regions are herein called complementarity determining * f$Wf quence to form a composite V^V £ 

regions or CDRs. These CDR regions are held in ap- „ or ^SrSL . ? f* 
propriate conformation by polypeptide segments analo- 25 

*\T #kjl r Mm -„ frt -i, JLL n j!*Je p^<vo™*** rtf provides proteins analogous to an independent V// or 

Sturaf a^bSs ^ Vidomain - ** y ^ *ese may bTprovided in a 

■wMi 'wwwu««- form linked to amino acid sequences exclusive of those 

The term CDR as used herein, refers to amino acid rf ^ dQm ^ to ^ acids 

sequences which together define the binding affinity 3Q ^ or homologous * pro tehis of a constant do- 

and specificity of the natural Fv region of a native tin- ^ or bioactivc molecules such w a hormone 

munoglobuhn binding site, or a synthetic polypeptide or toxin A proteolytic cJcavage silc ^ also be de- 

which mimics this function. CDRs typically ^are not Bi ^ Mo the f ^ on siting the variable region . 

wholly homologous to hypervanable regions of natural m from othcr veaiattt sequences so as to 

Fvs t but rather also include specific amino acids or 35 fecilitate cleavage of intact V^and/or V/r, free of other 

amino acid sequences which flank the hypervanable protein, 

region and have heretofore been considered framework FIG s 2 A, 2B, 2C, and 2D illustrate four examples of 
not directly determinitive of complementarity. The pro ttin structures embodying the invention that can be 
term FR, as used herein, refers to amino acid sequences produced by following the teaching disclosed herein, 
interposed between CDRs. 40 All are characterized by one or two biosynthetic poly- 

The CDR and FR polypeptide segments are designed pep (idc segments defining a binding site 3, and compris- 
empirically based on sequence analysis of the Fv region ■ fog amino acid sequences comprising CDRs and FRs, 
of preexisting antibodies or of the DNA encoding them. often deriv ed from different immunoglobulins, or se- 
In one embodiment, the amino acid sequences constitut- quences homologous to a portion of CDRs and FRs 
ing the FR regions of the BABS are analogous to the 45 from different immunoglobulins. FIG, 2A depicts a 
FR sequences of a first preexisting antibody, for exam- s in g i c chain Fv comprising a polypeptide 10 having an 
ple t a human IgG. The amino acid sequences constitut* amino acid sequence analogous to the variable region of 
ing the CDR regions axe analogous to the sequences ^ immunoglobulin heavy chain, bound through its 
from a second, different preexisting antibody, for exam- carboxyl end to a polypeptide linker 12, which in turn is 
pie, the CDRs of a murine IgG. Alternatively, the 50 bound to a polypeptide 14 having an amino acid $e- 
CDRs and FRs from a single preexisting antibody from, quence analogous to the variable region of an immuno- 
e.g. f an unstable or hard to culture hybridoma, may be globulin light chain. Of course, the light and heavy 
copied in their entirety* chain domains may be in reverse order. The linker 12 

Practice of the invention enables the design and bio- should be at least long enough (e.g., about 15 amino 
synthesis of various reagents, all of which are character- 55 acids or about 40A) to permit the chains 10 and 14 to 
ized by a region having affinity for a preselected ami- assume their proper conformation. The linker 12 may 
genie substance. Other regions of the biosynthetic pro- comprise an amino acid sequence homologous to a se- 
tein are designed with the particular planned utility of quence identified as "self by the species into which it 
the protein in mind. Thus, if the reagent is designed for will be introduced, if drug use is intended. Unstruc* 
intravascular use in mammals, the FR regions comprise 60 tured, hydrophilic amino acid sequences are preferred, 
amino acids similar or identical to at least a portion of It may also comprise a bioactivc polypeptide such as a 
the framework region amino acids of antibodies native cell toxin which is to be targeted by the binding site, or 
to that mammalian species. On the other hand, the a segment easily labeled by a radioactive reagent which 
amino acids comprising the CDRs may be analogous to is to be delivered, e.g. t to the site of a tumor comprising 
a portion of the amino acids from the hypervanable 65 an epitope recognized by the binding site. Other prote- 
region (and certain flanking amino acids) of an antibody ins or polypeptides may be attached to either the amino 
having a known affinity and specificity, e.g., a murine or or carboxyl terminus of protein of the type illustrated in 
rat monoclonal antibody. FIG. 2A. As an example, a helically coiled polypeptide 
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structure illustrating a leader comprising a protein A proper conformation. The various proteins are then 
fragment is shown extending from the amino terminal tested for binding ability, and one having appropriate 
end of V# domain 10. affinity is selected for incorporation into a reagent of 

FIG. 2B illustrates two separate chains non-cova- the type described above. If necessary, point substitu- 
lently associated and defining a binding site 3. It com- 5 tions seeking to optimize binding may be made in the 
prises separate peptides 16 and 18 comprising a chimeric DNA using conventional casette mutagenesis or other 
V^yand Vi of the type described above. Tne carboxyl protein engineering methodology, 
terminus of each protein chain may be designed to in- Of course, the processes for manipulating, amplify- 
clude one or more cysteine residues so that oxidation of ing, and recombining DNA which encode amino acid 
properly folded structures produces disulfide bonds (see 10 sequences of interest are generally well known in the 
FIG. 2C) further stabilizing the BABS. Either or both art, and therefore, not described in detail herein. Metb- 
of the polypeptides may further comprise a fused pro- ods of identifying and isolating genes encoding antibod- 
tein imparting other biological properties to the reagent ies of interest are well understood, and described in the 
in addition t the ability to bind to the antigen as speci- patent and other literature. In general, the methods 
tied by the interaction of the triplet CDRs on the re* 15 involve selecting genetic material coding for amino 
spective polypeptides 16 and 18. acids which define the CDRs and FRs of interest ac- 

FIG. 2D depicts another type of reagent, comprising cording to the genetic code, 
only one set of three CDRs, e.g., analogous to a heavy Accordingly, the construction of DNAs encoding 
chain variable region, which retains a measure of affin- BABS as disclosed herein can be done using known 
ity for the antigen. Attached to the carboxyl end of the 20 techniques involving the use of various restriction en- . 
polypeptide comprising the FR and CDR sequences zymes which make sequence specific cuts in DNA to 
constituting the binding site 3 is a Pendant Protein P produce, blunt ends or cohesive ends, DNA ligases, 
consisting of, for example, a toxin, therapeutic drug, techniques enabling enzymatic addition of sticky ends 
binding protein, enzyme or enzyme fragment, site of to blunt-ended DNA, construction of synthetic DNAs 
attachment for an imaging agent (e.g.» to chelate a radi- 25 by assembly of short or medium length oligonucleo- 
oactive ion such as Indium), or site of attachment to an tides, cDN A synthesis techniques, and synthetic probes 
immobilization matrix so that the BABS can be used in for isolating immunoglobulin genes. Various promoter 
affinity chromatography. sequences and other regulatory DNA sequences used in 

Of course, the protein may comprise more than one achieving expression, and various types of host cells are 
binding site or copies of a single binding site, and a 30 also known and available. Conventional transfection 
number of other functional regions. techniques, and equally conventional techniques for 

As is evidenced from the foregoing, the invention cloning and subcloning DNA are useful in the practice 
provides a large family of reagents comprising proteins, of this invention and known to those skilled in the art. 
at least a portion of which defines a binding site pat- Various types of vectors may be used such as plasmids 
terned after the variable region or regions of natural 35 and viruses including animal viruses and bacterio- 
immunoglobulins. It will be apparent that the nature of phages. The vectors may exploit various marker genes 
any protein fragments linked to the BABS, and used for which impart to a successfully transfected cell a detect- 
reagents embodying the invention, are essentially un- able phenotypic property that can be used' to identify 
limited, the essence of the invention being the provision, which of a family of clones has successfully incorpo- 
either alone or linked in various ways to other proteins, 40 rated the recombinant DNA of the vector, 
of binding sites having specificities to any antigen de~ One method for obtaining DNA encoding the BABS 
sired. disclosed herein is by assembly of synthetic oligonucle- 

The BABS of the invention are designed at the DNA otides produced in a conventional, automated, polynu- 
leve). The chimeric or synthetic DNAs are then ex* cleotide synthesizer followed by ligation with appropri* 
pressed in a suitable host system, and the expressed 45 ate ligases. For example, overlapping, complementary 
proteins are collected and renaturcd if necessary. " DNA fragments comprising 15 bases may be synthe- 
The ability to design the BABS of the invention de- sized semi manually using phosphoramidite chemistry, 
pends on the ability to determine the sequence of the with end segments left unphosphorylated to prevent 
amino acids in the variable region of monoclonal anti- polymerization during ligation. One end of the syn- 
bodies of interest, or the DNA encoding them. Hy- 50 thetic DNA is left with a "sticky end" corresponding to 
bridoma technology enables production of cell lines the site of action of a particular restriction endonucle- 
secreting antibody to essentially any desired substance ase, and the other end is left with an end corresponding 
that produces an immune response. RNA encoding the to the site of action of another restriction endonuclease. 
light and heavy chains of the immunoglobulin can then Alternatively, this approach can be fully automated, 
be obtained from the cytoplasm of the hybridoma, and 55 The DNA encoding the BABS may be created by syn- 
dic 5' end portion of the mRNA can be used to prepare thesiziog longer tingle strand fragments (eg., 50-100 . 
the cDNA for subsequent sequencing, or the amino acid nucleotides long) in, for example, a Biosearch oligonu* 
sequence of the hypervariable and flanking framework cleotide synthesizer, and then ligating the fragments, 
regions can be determined by amino acid sequencing of Still another method of producing the BABS of the 
the H and L chains and their V region fragments. Such .60 invention is to produce a synthetic DNA encoding a 
sequence analysis is now conducted routinely. This polypeptide comprising, e.g., human FRs, and taterven- 
knowledge permits one to design synthetic genes en* mg^dummy^CDR^ or ammo adds having no function 
coding FR and CDR sequences which likely will bind except to define suitably situated unique restriction 
the antigen. These synthetic genes are then prepared sites. This synthetic DNA is then altered by DNA re- 
using known techniques, or using the technique dis- 65 placement, in which restriction and ligation is employ 
closed below, and then inserted into a suitable host, ed to insert synthetic oligonucleotides encoding CDRs 
expressed, and purified. Depending on the host cell, defining a desired binding specificity in the proper loca- 
maturation techniques may be required to attain tion between the FRs. 
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This technique is dependent upon the ability to cleave and then the fused polypeptide sequence is expressed, 

a DNA corresponding in structure to a variable domain tested, and altered for empirical refinement, 

gene at specific sites flanking nucleotide sequences en* The hinge region is free of cysteine. This enables the 

coding CDRs. These restriction sites in some cases may cleavage reaction to be conducted under conditions in 

be found in the native gene. Alternatively, non-native 5 which the protein assumes its tertiary conformation, 

restriction sites may be engineered into the nucleotide and may be held in this conformation by intramolecular 

sequence resulting in a synthetic gene with a different disulfide bonds. It has been discovered that in these 

sequence of nucleotides than the native gene, but encod- conditions access of the protease to potential cleavage 

ing the same variable region amino acids because of the sites which may be present within the target protein is 

degeneracy of the genetic code. The fragments result- 10 hindered. The hinge region may comprise an amino 

ing from endonuclease digestion, and comprising FR- acid sequence which includes one or more proline resi- 

encoding sequences, are then ligated to non-native dues. This allows formation of a substantially unfolded 

CDR-encoding sequences to produce a synthetic van- molecular segment. Aspartic acid, glutamic acid, argi- 

able domain gene with altered antigen binding specifity. nine, lysine, serine, and threonine residues nmimize 

Additional nucleotide sequences encoding, for example, 15 ionic interactions and may be present in amounts and/or 

constant region amino acids or a bioactive molecule in sequence which renders the moiety comprising the 

may also be linked to the gene sequences to produce a hinge water soluble. 

Afunctional protein. In the case of single chain Fv comprising fused H and 

The expression of these synthetic DNA's can be L chains, the cleavage site preferably is immediately 

achieved in both prokaryotic and eucaryotic systems 20 adjacent the Fv polypeptide and comprises one or a 

via transfection with the appropriate vector. In E. coli sequence of amino acids exclusive of any one or se- 

and other microbial hosts, the synthetic genes can be quence found in the amino acid structure of the BABS. 

expressed as fusion protein. Expression in eucaryotes Where BABS V# and V/, regions are on separate chains 

can be accomplished by the transfection of DNA se- (i.e., see FIG. 1A), the cleavage sites may be either 

quences encoding CDR and FR region amino acids into 25 immediately adjacent their C-terminal ends, thereby 

a myeloma or other type of cell line. By this strategy releasing Fv dimer of V# and V/, upon appropriate 

intact hybrid antibody molecules having hybrid Fv cleavage (i.e., to yield the species of FIG, 2B), or may 

regions and various bioactive proteins including a bio- follow pendant polypeptides with or without cysteine 

synthetic binding domain may be produced. For fusion that yield, respectively, the species of FIG. 2C or 2D 

protein expressed in bacteria subsequent proteolytic 30 upon digestion. 

cleavage of the isolated V/j and Vj. fusions can be per- The cleavage site preferably is designed for cleavage 

formed to yield free V// and Vi t which can be rena- by a specific selected agent. Endopeptidases are pre- 

tured, and reassociated (or used separately) to obtain an ferred, although non-enzymatic (chemical) cleavage 

intact biosynthetic, hybrid antibody binding site. agents may be used. Many useful cleavage agents, for 

Heretofore, it has not been possible to cleave the 35 instance, cyanogen bromide, dilute acid, trypsin, Staph- 

heavy and light chain region to separate the variable ylococcus aureus V-8 protease, post proline cleaving 

and constant regions of an immunoglobulin so as to enzyme, blood coagulation Factor Xa, enterokinase, 

produce intact Fv, except in specific cases not of gen- and renin, recognize and preferentially or exclusively 

era! utility. However, one method of producing BABS cleave particular cleavage sites. One currently pre- 

in accordance with this invention is to redesign an im- 40 ferred cleavage agent is V-8 protease. The currently 

munoglobulin at the DNA level so as to alter its speci- preferred cleavage site is a Glu residue. Other useful 

ficity and so as to incorporate a cleavage site and "hinge enzymes recognize multiple residues as a cleavage site, 

region" between the variable and constant regions of e.g., factor Xa (Ile-Glu-Gly-Arg) or enterokinase (AsP- 

both the heavy and light chains. Such chimeric antibod* Asp-AsP-Asp-Lys). 

ies can be produced in transfectomas or the like and 45 pypupt mrATTAN 

subsequently cleaved using a preselected endopepti- cAcjyirtiriwwiuw 

dase. The engineering principles involved in these easily FRs from the heavy and light chain murine anti- 

cleaved constructs are disclosed in detail in copending digoxin monoclonal 26*10 (FIGS. 4A and 4B) were 

U.S. application Ser. No. 028,484 filed Mar. 20, 1987 by encoded on the same DNAs with CDRs from the mu- 

Huston et al. 50 rine anti-lysozyme monoclonal glp*4 heavy chain (FIG. 

The hinge region is a sequence of amino acids which 3 sequence 1) and light chain to produce heavy (FIG. 

serve t o promote efficient cleavage by a preselected 4C) and light (FIG. 4D) chain together defining a chi- 

cleavage agent at a preselected, built-in cleavage site. It meric antibody binding site which is specific for . lysc- 

is designed to promote cleavage preferentially at the zyme. Murine CDRs from both the h$avy and light 

cleavage site when the polypeptide is treated with the 55 chains of monoclonal glp-4 were encoded on the same 

cleavage agent in an appropriate environment. DNAs with FRs from the heavy and light c / hains of 

The hinge can take many different forms. Its design human myeloma antibody NEWM (FIGS. 4E and 4F). 

involves selection of amino acid residues (and a DNA The resulting interspecies chimeric antibody binding 

fragment encoding them) which impart to the region of domain has reduced immimogenicity in humans because 

the fused protein about the cleavage site an appropriate 60 of its human FRs, and has specificity for lysozyme be- 

polarity, charge distribution, and stereochemistry cause of its murine CDRs. 

whidh, in the aqueous environment where the cleavage A synthetic DNA was designed to facilitate CDR 

takes place, efficiently exposes the cleavage site to the insertions into a human heavy chain framework and to 

cleavage agent in preference to other potential cleavage facilitate empirical refinement of the resulting chimeric 

sites that may be present in the polypeptide, and/or to 65 amino acid sequence. This DNA is depicted in FIG. 5. 

improve the kinetics of the cleavage reaction, In spe- A synthetic, bifunctional protein was also designed at 

cific cases the amino acids of the hinge are selected and the DNA level, expressed, purified, rena tured, and 

assembled in sequence based on their known properties, shown to bind specifically with a preselected antigen 
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(digoxin). The detailed primary structure of this con- Lines 4 and 5 of FIG. 3 represent another construct, 

struct is shown in FIG. 6; its tertiary structure is illus- Line 4 is the full length \ H of the human antibody 

trated schematically in FIG. 2A. NEWM. That human antibody may be made specific 

Details of these experiments, and the design princi- for lysozyme by CDR replacement as shown in line 5. 

pies on which the invention is based, are set forth be- 5 Thus, for example, the segment TFTNYYIHWLK 

tow- from glp-4 replaces TFSNDYYTWVR of NEWM, and 

I. GENE DESIGN AND EXPRESSION ?* otber CDRs m rc P laced " shown - ^ r «« lls m a 

V/f comprising a human framework with mouse sequen- 

With the help of a computer program and known ccs determining specificity, 
variable region DNA sequences, synthetic and V* 10 By sequencing any antibody, or obtaining the se- 

genes may be designed which encode native or near qucnce {rom ^ literature, in view of this disclosure one 

native FR and CDR amino acid sequences from an ^,1^ m the art can produce a BABS of any desired 

antibody molecule, each separated by unique restriction tpecificity comprising any desired framework region, 

sites located as close to FR-CDR and CDR-FR borders Diagrams such as FIG. 3 comparing the amino acid 
as possible. Alternatively, genes may be designed which 15 sequence m valuable in suggesting which particular 

encode native FR sequences which arc sumlar or identi- aclds should ^ placed to defame the desired 

cal to the FRsof an antibody mol«u e from a selected co^pi^entority. Expressed sequences may be tested 

species, each separated by "dummy" CDR sequences for bindin ^j^y rcflncd by „ chang in g se- 

contaimng strategically located restriction sites. These lectcd uojno Kid * in relatively conserved regions, 

DNAs serve as starting materials for producmg BABS, 20 bascd on ottttaik k ^ acid sequence 

as the native or "dummy" CDR sequences may be ex- data ^ utcr modeling techniques, 

csed and replaced with sequences encoding the CDR significant flexibility in V* and V^d^sign is possible 

amino acids defining a selected bmd.ng site. Al ema- ^ t ad * ^ ^nnmedat the 

tively, one may design and direct y synthesize native or , , ~~ A „i " ■ , ^ r ™5 * - 

near-native FR sequences from a first antibody mole- 25 ^VlTd "J mMU P ulat,on of DNA B now » c " 

cule, and CDR sequences from 8 second antibody mole- co ^P 15 «■* v - 

cule. Any one of the V* and W L sequences described v * ' * C DNA "? USC n V ^ d 

above may be linked together directly, either via an Vl ►conteuung specific resmction sites flanking 

amino acids chain or linker connecting the C-termimis «** of ** *** CDRs was designed with the aid of a 

of one chain with the N-terminus of the other, or via 30 conra^r available computer program which pcr- 

C-terminal cysteine residues on each of the V* and V L . foms combined reverse translation and nstnction site 

These genes, once synthesized, may be cloned with C* V.exe by Compugene Inc.), The known 

or without additional DNA sequences coding for, e.g., anun0 ac,d i for V*«* v f 26 " ^polypeptides 

an antibody constant region, or a leader peptide which WCTC entered, and all potential DNA sequences which 

facilitates secretion or intracellular stability of a fusion 35 thosc P*pt»d« and all potential restriction sites 

polypeptide. The genes then can be expressed directly werc analyzed by the program. The program can, in 

in an appropriate host cell, or can be further engineered addition, select DNA sequences encoding the peptide 

before expression by the exchange of FR, CDR, or °nJy ** Q ™ preferred by R coli if this bacterium 

"dummy" CDR sequences with new sequences. This is to be host expression organism of choice. FIGS. 4A 

manipulation is facilitated by the presence of the restrict 40 and 4B show an example of program output, The nu- 

tion sites which have been engineered into the gene at c, «c acid sequences of the synthetic gene and the corre- 

the FR-CDR and CDR-FR borders. sponding amino acids are shown. Sites of restriction 

FIG. 3 illustrates the general approach to designing a cndonuclease cleavage are also indicated. The CDRs of 
chimeric further details of exemplary designs at the these synthetic genes are underlined. 
DNA level are shown in FIGS. 4A-4F. FIG. 3, lines I 45 The DNA sequences for the synthetic 26-10 V// and 
and 2, show the amino acid sequences of the heavy Vi are designed so that one or both of the restriction 
chain variable region of the murine monoclones glp-4 sites flanking each of the three CDRs are unique. A six 
(anti-lysozyme) and 26-10 (anti-digoxin), including the base site (such as that recognized by Bsm I or BsPM I) 
four FR and three CDR sequences of each. Line 3 is preferred, but where six base sites are not possible, 
shows the sequence of a chimeric Vjy which comprises 50 four or five base sites are used. These sites, if not already 
26-10 FRs and glp-4 CDRs: As illustrated, the hybrid unique, are rendered unique within the gene by elimi- 
protein of line 3 is identical to the native protein of line nating other occurrences within the gene without alter- 
2, except that 1) the se quenc e TFTNYYIHWLK has ing necessary amino acid sequences. Preferred cleavage 
replaced the sequence IFTDFYMNWVR, 2) EWIG- sites are those that, once cleaved, yield fragments with 
WIYPGNGNTKYNENFKG has replaced DYIGYIS- 55 sticky ends just outside of the boundary of the CDR 
PYSG VTG YNQKFKG , 3) RYTHYYF has replaced within the framework. However, such ideal sites are 
GSSGNK WAM, and 4) A has replaced V as the sixth only occasionally possible because the FR-CDR bound* 
amino acid beyond CDR-2. These changes have the try is not an absolute one, and because the amino acid 
effect of changing the specificity of the 26-10 V# to sequence of the FR may not permit a restriction site. In" 
mimic the specificity of glp-4. The Ala to Val single 60 these cases, flanking sites in the FR which are more 
amino acid replacement within the relatively conserved distant from the predicted boundary are selected, 
framework region of 26-10 is an example of the replace- FIG. 5 discloses the nucleotide and corresponding 
ment of an amino acid outside the hypervariable region amino acid sequence (shown in standard single letter 
made for the purpose of altering specificity by CDR code) of a synthetic DNA comprising a master frame- 
replacement. Beneath sequence 3 of FIG. 3, the restric* 65 work gene having the generic structure: 
tion sites in the DNA encoding the chimer V/y (see 

FIGS. 4A-4F) are shown which are disposed about the Ri-FRj-Xj-FR^Xj-FRj^Xa-nu-^ 
CDR-FR borders. 
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where R| and R 2 are blunt ends which are to be Hgated v P Yim fusion r>F P»nTPTN<! 

into a vector and X), X 2 , and X 3 are DNA sequences V ' hArKWWUH UF rKOTE1NS 

whose only function is to provide convenient restriction The engineered genes can be expressed in appropriate 

sites for CDR insertion. This particular DNA has prokaryotic hosts such as various strains of E. coli, and 
mouse FR sequences and unique, 6-base restriction sites 5 w eucaryotic hosts such as Chinese hamster ovary cell, 

adjacent the FR borders so that nucleotide sequences mouse myeloma, and human myeloma/transfectoma 

encoding CDRs from a desired monoclonal can be in- ce '^ s * 

serted easily. Restriction endonuclease digestion sites . For «™Pte. if"* g«* 15 «<> be expressed in E coli, 

are indicated with their abbreviations; enzymes of * m ^ *f cloned ,nt ? *? «pressron vector. This is 
choice for CDR replacement are underscored. Diges- 10 accomplished by posmoning the engineered gene 

tion of the gene with the following restriction endonu- downstream from a promoter sequence such as Trp or 

cleases results in 3' and 5' ends which can easily be T ac ' and t ■?* * p f? tldc Such » 

matched up with and ligated to native or synthetic B of protem A <FB). The resultmg expressed 

CDRs of desired specific^.- KPnl and BstXI are used „ ^l^S"^^ £ 
r t *• r /^Tvn VL i j i r sm-n j l* cytoplasm of the cells, and may be harvested after dis- 
for ligation of CDR^ Xbal and Oral for CDR 2 ; and of thc ^ by French ^ Qr ^ 

USSHU ana Uai ror L.L1K3. refractile bodies are solubilized, and the expressed pro- 

IL OLIGONUCLEOTIDE SYNTHESIS tein ? refolded and cleaved by the methods already es- 

tablished for many other recombinant proteins. 
The synthetic genes and DNA fragments designed as 20 If ±e engineered gene is to be expressed in myeloma 
described above preferably are produced by assembly ^ lhe conventional expression system for immuno- 
of chemically synthesized oligonucleotides. 15-100mer globulins, it is first inserted into an expression vector 
oligonucleotides may be synthesized on a Biosearch containing, for example, the Ig promoter, a secretion 
DNA Model 8600 Synthesizer, and purified by poly- signal* immunoglobulin enhancers, and various introns. 
acrylamide gel electrophoresis (PAGE) in Tris-Borate- 25 This plasmid may also contain sequences encoding all 
EDTA buffer (TBE). The DNA is then electrocuted 0 r part of a constant region, enabling an entire part of a 
from the gel. Overlapping oligomers may be phosphor- heavy or light chain to be expressed. The gene is trans* 
ylated by T4 polynucleotide kinase and ligated into fected into myeloma cells via established electropora- 
larger blocks which may also be purified by PAGE. tion or protoplast fusion methods. Cells so transfected 
tit ri nwiMr* QVWTWPTTr 30 e *P ress v ^ or V/f fragments, Vi-Vjy heterodiroers, 

heavy or light immunoglobulin chains, or portions 
The blocks or the pairs of longer oligonucleotides thereof, each of which may be attached in the various 
may be cloned into B coli using a suitable, e.g., pUC, . ways discussed above to a protein domain having an* 
cloning vector. Initially, this vector may be altered by 35 other function (e.g., cytotoxicity), 
single strand mutagenesis to eliminate residual six base Vectors containing a heavy chain V region (or V and 
altered sites. For example, V# may be synthesized and C regions) can be cotransfected with analogous vectors 
cloned into pUC as five primary blocks spanning the carrying a light chain V region (or V and C regions), 
following restriction sites: L EcoRI to first Narl site; 2. allowing for the expression of noncovalently associated 
first Narl to Xbal; 3. Xbal to Sail; 4. Sail to Ncol; 5. *™ <<* complete antibody molecules). 

Ncol to BamHI. These cloned fragments may then be CDR Exchange in a Synthetic Gene 

isolated and assembled in several three-fragment liga- 
tions and cloning steps into the pUC8 plasmid. Desired ™ e synthetic gene coding for mouse V# and Vx. 
ligations selected by PAGE are then transformed into, 2M° shown in FIGS- 4A and 4B were designed from 
for example, £ coli strain JM83, and plated onto LB 45 *f amno BC,d sequence of the protein with the 

Ampicillin +Xgal Plates according to standard proce- ™» of G™Wm a software program. These genes, 
dures. The gene sequence may be confirmed by super- P 0 *"* fo T ™™* 

coil sequencing after cloning, or after subcloning into ^° ntam Md often ^erestnetton sites 
M13^ the dideoxy method of Sanger. * ^ ***** «g« f d ^oes encoding CDR's to 

0 50 facilitate CDR replacement as noted above. 

IV. CDR EXCHANGE Both the 3' and 5' ends of the large synthetic oligo- 

m. win /~ u ... ^ 1 r~. ™.\ . mere were designed to include 6-base restriction sites, 

i2T ? v (o^ternatively, four FRs) can be re- . ^ ^ ^ ^ uc vector Furthennore , 

P ^ "J*.? TS e 1 th " T 5 aC " those restriction sites in the synthetic genes which were 

complied by cutting the shuttle pUC plasmid contain- 5J onJ suited for for ^ loning ^ pUC 

ing the respective genes at the two unique restriction wcre cxtcnded b « be , » donin sitcs ^£ etching 

sites flanking each CDR or FR, removing the excised ^les in pUC. 

sequence, and ligating the vector with a native nucleic Cloning of the synthetic DNA and later assembly.of 
acid sequence or a synthetic oligonucleotide encoding the g enc was facilitated by the spacing of unique restric- 
the desired CDR or FR. This three part procedure ^ tion sites along the gene. This allows corrections and 
would have to be repeated three times for total CDR modifications by cassette mutagenesis at any location, 
replacement and four times for total FR replacement. Among them are alterations near the 5' or 3' ends of the 
Alternatively, a synthetic nucleotide encoding two con- gene as needed for the adaptation to different expression 
secuttve CDRs separated by the appropriate FR can be vectors, For example, a PstI site is positioned near the 5' 
ligated to a pUC or other plasmid containing a gene 65 end of the Vj/gene. Synthetic linkers can be attached 
whose corresponding CDRs and FR have been cleaved easily between this site and a restriction site in the ex- 
out. This procedure reduces the number of steps re- pression plasmid. These genes were synthesized by 
quired to perform CDR and/or FR exchange. assembling oligonucleotides as described above using a 
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Biosearch Mode) 8600 DNA Synthesizer. They were These constructs also constitute master framework 
ligated to vector pUC8 for transformation of R colL genes, this time constructed of human framework se- 
Specific CDRs may be cleaved from the synthetic quences* They may be used to construct BABS of any 

Vjygene by digestion with the following pairs of restric- . desired specificity by appropriate CDR replacement, 

lion endonucleases: HpHl and BstXI for CDRi; Xbal 5 « 
and Dral for CDR 2 ; and Banll and BanI for CDRj. Synthesis of a Single Chain Fv 

After removal of one CDR, another CDR of desired A nucleic acid sequence encoding a composite Fv 
specificity may be ligated directly into the restricted region or single chain antibody binding site was de- 
gene, in its place if the 3' and 5' ends of the restricted signed with the aid of Compugene, a computer, program 
gene and the new CDR contain complementary single 10 as described above* This gene contains nucleic acid 
stranded DNA sequences. sequences encoding the V# and Vl regions of mouse 
In the present example, the three CDRs of each of 26-10 antibody linked together with a double-stranded 
mouse 26-10 and Vi 26-10 were replaccad with the synthetic oligonucleotide coding for a peptide with the 
corresponding CDRs of glp-4. The nucleic acid sequen- amino acid sequence (Gly Gly Gly Gly Ser)3 as shown 
ces and corresponding amino acid sequences of the 15 in FIG. 6. This linker oligonucleotide contains helper 
chimeric V# and Vi genes encoding the FRs of 26-10 cloning sites EcoRI and BamHI and was designed to . 
and CDRs of glp-4 are shown in FIGS. 4C and 4D. The contain the assembly sites Sad and Aatll near its 5' and 
positions of the restriction endonuclease cleavage sites 3' ends, respectively. These sites enable match-up and ' 
are noted with their standard abbreviations. CDR se- ligation to the 3' and 5' ends Vh of and Vj.26-10, rese- 
quences are underlined as are the restriction endonu- 20 tively t which also contain these sites (V/rinker-Vz.). 
cleases of choice useful for further CDR replacement. However, the order of linkage to the oligonucleotide 
These genes were cloned into pUC8. a shuttle plas- may be reversed (V/.-linker-Vtf). Other restriction sites 
mid. To retain unique restriction sites after cloning, the were designed into the gene to provide alternative as- 
Vtf-like gene was spliced into the EcoRI and Hindlll or scmbly sites. A sequence encoding the FB fragment of 
BamHI sites of the plasmid. . 25 protein A was used as a leader. 

Direct expression of the genes may be achieved in R The gene fragments were synthesized using a Bi- 

coiL Alternatively, the gene may be expressed in R coli ©search DNA Model 8600 Synthesizer as described 

as a fusion product by splicing it into the host gene above. Synthetic oligonucleotides were cloned accord- 

whose expression is regulated by interaction of a repres- ing to established protocol described above using the 

sor with the respective operator, The protein can be 30 pUC8 vector transfected into R coli The completed 

induced by starvation in minimal medium and by chemi- fused gene set forth in FIG, 6 was expressed in R Coli 
cal inducers. To date, the V# biosynthetic 26-10 gene After sonication, inclusion bodies were collected by 

has been expressed as such a fusion peptide behind the centrifugation, and dissolved in 6M guanidine hydro- 

Trp and Tac promoters. The gene translation product chloride (GuHCl), 0.2M Tris, and 0.1 M 2-mercaptoe- 

must then be cleaved from the fusion protein by e.g., 35 thanol (BME) PH 8.2. The protein was denatured and 

cyanogen bromide degradation, tryptic digestion, mild reduced in the solvent overnight at room temperature, 

acid cleavage, and/or digestion with factor Xa prote* Size exclusion chromatography was used to purify fu- 

ase. Therefore, a shuttle plasmid containing a synthetic sion protein from the inclusion bodies. A Sepharosc 4B 

gene encoding a leader peptide having a site for mild column (1.5x80 cm) was run in a solvent of 6M GuHCl 

acid cleavage, and into which has been spliced the syn- 40 and 0.01M NaOAc at pH 4.75. The protein solution was 

thetic V/f gene could be used for this purpose. In addi- applied to the column at room temperature in 0.5-1.0 ml 

tion, synthetic DNA sequences encoding a signal pep- amounts. Fractions were collected and precipitated 

tide for secretion of the fusion protein into the peri- with cold ethanol. These were run on SDS gels, and 

plasm of the host cell can also be incorporated into the fractions rich in the recombinant protein (approxi- 

plasmid. 45 mately 34,000d) were pooled. This offers a simple first 

After harvesting the gene product and optionally step for cleaning up inclusion body preparations with- 

releasing it from a fusion peptide, its activity as an anti- out suffering significant proteolytic degradation, 
body binding site and its specificity for glp-4 (lysozyme) For refolding, the protein was dialyzed against 100 

epitope are assayed by established immunological tech- ml of the same GuHCKTris-BME solution, and dialy- 

niques, e.g., radioimmunoassay. Correct folding of the 50 sate was diluted 1 1-fold over two days to 0.55M 

protein to yield the proper three-dimensional conforma- GuHCl, 0.02M Tris, and 0.01M BME. The dialysis 

tion of the antibody binding site is prerequisite for its sacks were then transferred to 0.0 1M NaCI, and the 

activity. This occurs spontaneously in a host such as a protein was dialyzed exhaustively before being assayed 

myeloma cell which naturally expresses immunoglobu- by RLVs for binding of 1-125 labeled digoxin. The re- 

lin proteins. Alternatively, for bacterial expression, the 55 folding procedure can be simplified by making a rapid 

protein forms inclusion bodies which, after harvesting, dilution with water to reduce the GuHCl concentration 

must be subjected to a specific sequence of solvent to MM, and then dialyzing against phosphate buffered 

conditions (e.g., diluted 20 X from 8M urea 0.01M Tris- saline (0.15M NaCI, 0.05M potassium phosphate, pH7, 

HC1 pH9 into 0.15M NaCI, 0.01M sodium phosphate, containing 0,03% Naty). so that it is free of any GuHCl 

PH 7.4 (Hochman et al., 1976 Biochem. 15:2706-2710) 60 within 12 hours. Product of both types o f preparation 

to assume its correct conformation and hence its active showed binding activity. 

form. All of the assays were conducted by a modification of 

FIGS. 4E and 4F show the DNA and amino acid the procedure of Mudgett-Hunter et al., (1982, J. 

sequence of chimeric V/f and Vt comprising human Immunol. 129:1165-1172; 1985, Molec. Immunol. 

FRs from NEWM and mouse CDRs from glp-4. The 65 22:477-488), so that they could be run on microtiter 

CDRs are underlined, as are restriction sites of choice plates as a solid phase sandwich assay. Binding data 

for further CDR replacement or empirically deter- were collected using goat anti-mouse Fab antisera 

mined refinement. (gAmFab) as the primary antibody that initially coats 
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the wells of the plate. These are polyclonal antisera 
which recognize epitopes that appear to reside mostly 
on mouse V/., The samples of interest are next added to 
the coated wells and incubated with the gAmFab, 
which binds species that exhibit appropriate antigenic 
sites. After washing away unbound protein, the wells 
are exposed to M25 labeled (radioiodinated) digoxin 
conjugates, either as I-125-dig-BSA or I-125-dig4ysine. 

The data are plotted in FIG. 7, which shows the 
results of a dilution curve experiment in which the par* 
ent 26*10 antibody was included as a control. The sites 
were probed with M25~dig-BSA in this assay. It was 
conducted as described above, with 8 series of dilutions 
prepared from initial stock solutions, including both the 
slowly refolded (1) and fast diluted/quickly refolded (2) 
single chain Fv proteins. The. parallelism between all 
three dilution curves indicates that gAmFab binding 
regions on the BABS molecule are essentially the same 
as on the Fv of authentic 26-10 antibody, i.e., the surface 
epitopes appear to be the same for both proteins. 

The sensitivity of these assays is such that binding 
affinity of the Fv for digoxin must be at least 10 6 . The 
parent 26-10 antibody has an affinity of 7X10 9 M- 1 . 
Inhibition assays indicate the binding of M25-digoxin- 
lysine may be as high as 10*» and can be inhibited by 
unlabeled digoxin, digoxigenin, digitoxin, digitoxigenin, 
gitoxin, acetyl strophanthidin, and ouabain in a way 
exactly parallel to the parent 26-10 Fab. This demon- 
strates that the specificity of the biosynthetic protein is 
substantially identical to the original monoclonal. 

The invention may be embodied in other specific 
forms without departing from the spirit and scope 
thereof. Accordingly, other embodiments are within 
the following claims. 

What is claimed is: 

1. A single polypeptide chain comprising: 

two polypeptide domains connected by a polypeptide 
linker spanning the distance between the C-ter- 
minus of one domain to the N-terminus of the other 
and defining a single and complete site for binding * 
a preselected antigen, wherein the amino acid se- 
quence of one of said polypeptide domains com- 
prises a heavy chain variable region, and the amino 
acid sequence of the other of said polypeptide do- 45 
mains comprises a light chain variable region, 
wherein at least one of said polypeptide domains 
comprises 

a recombinant polypeptide comprising: 

a set of CDR amino acid sequences together defining 50 
a recognition site for said preselected antigen, 

a set of FR amino acid sequences linked to said set of 
CDR sequences, 

said linked sets of CDR and FR amino acid sequences 
together defining a hybrid immunoglobulin vari 
able region binding domain which is immunologi 
cally reactive with said preselected antigen and 

a third amino acid sequence, peptide bonded to the N- 
or C- terminus of said site for binding, said third 
amino acid sequence comprising a single polypcp- 60 
tide chain having a conformation which confers 
biological activity to said third sequence under the 
same conditions that allow binding of said site for 
binding to said preselected antigen, said biological 
activity being independent of said site for binding. 65 

2. The polypeptide chain of claim 1 wherein said 
third amino acid sequence is peptide bonded to the 
N-tenninus of said site for binding. 
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3. The polypeptide chain of claim 1 wherein said 
third amino acid sequence is peptide bonded to the 
Oterminus of said site for binding. 

4. The polypeptide chain of claim 1 wherein said 
third amino acid sequence is a polypeptide immunologi- 
cally reactive with an antigen. 

5. The polypeptide chain of claim 1 wherein said 
third amino acid sequence is a polypeptide chain which 
binds to a site other than on said preselected antigen. 

6. The polypeptide chain of claim 1 wherein said 
third amino acid sequence is selected from the group 
consisting of toxins, sites for attachment to an immobili- 
zation matrix, and sites for attachment to a remotely 
detectable moiety. 

7. The polypeptide chain of claim 1 further compris- 
ing a radioactive atom bound to said polypeptide chain. 

8. A DNA encoding the polypeptide chain of claim 1. 

9. A single chain recombinant protein comprising a 
first polypeptide region peptide bonded to at least one 
other polypeptide region, 

one of said regions being immunologically reactive 
with a preselected antigenic site and comprising 
two polypeptide domains with the Oterminus of 
one peptide bonded to the N-terminus of the other 
through a polypeptide linker, each of said domains 
comprising a set of CDR amino acid sequences 
interposed between a set of FR amino acid sequen- 
ces, said sets of CDR and FR sequences in said 
domains together defining a binding site immuno- 
logically reactive with said preselected antigenic 
site, 

another of said polypeptide regions being peptide 
bonded to the N- or C-tcnninus of said immunolog- 
ically reactive binding site and comprising a single 
chain polypeptide having a conformation which 
confers biological activity to said another region 
under the same conditions that allow binding of 
said binding site region to said preselected anti- 
genic site, said biological activity being indepen- 
dent of said binding site region. 

10. A single chain recombinant binding protein com- 
prising at least two regions, 

one of said regions comprising two polypeptide do- 
mains connected by a polypeptide linker, the amino 
acid sequence of each of said polypeptide domains 
comprising a set of CDRs interposed between a set 
of FRs, said sets of CDRs and FRs together form- 
ing a single chain binding site immunologically 
reactive with a preselected antigenic site, 

said polypeptide linker comprising plural, peptide 
bonded amino acids defining a polypeptide which 
spans the distance between the Oterminus end of 
one of said domains and the N-tenninus and of the 
other of said domains when said binding site as- 
sumes a conformation suitable for binding, and 
comprising hydrophilic amino acids such that said 
linker assumes an unstructured polypeptide config- 
uration in aqueous solution, and 

another of said regions comprising an amino acid 
sequence, peptide bonded to the N- or C- terminus 
of said binding site, comprising a single chain poly* 
peptide having a conformation which confers bio- 
logical activity to said another region under the 
same conditions that allow binding of said binding 
sate to said preselected antigenic site, said biologi- 
cal activity being independent of said binding site 
region. 
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11. A single chain recombinant binding protein com- 
prising at least two regions, 

one of said regions comprising at least two polypep- 
tide domains connected by a polypeptide linker 
extending from the C-tenninus of one to the N-ter- 5 
minus of the other, the amino acid sequence of each 
of said polypeptide domains comprising a set of 
CDRs interposed between a set of FRs, said sets of 
CDRs and FRs together comprising a single chain 
binding site immunologically reactive with a prese- 10 
lected antigenic site, said binding site having a 
binding affinity of at least 10* liters/mole, and 

another of said regions comprising an amino acid 
sequence peptide bonded to the N- or C-tenninus 
of said binding site, comprising a single chain poly- 15 
peptide having a conformation which confers bio- 
logical activity to said another region under the 
same conditions that allow binding of said binding 
site region to said preselected antigenic site, said 
biological activity being independent of said bind* 20 
ing site region. 

12. A single chain recombinant binding protein com- 
prising at least two regions, 

one of said regions comprising at last two polypeptide 
domains connected by a polypeptide linker extend- 25 
ing form the C*tenninus of one to the N-tenninus 
of the other, the amino acid sequence of each of 
said polypeptide domains comprising a set of 
CDRs interposed between a set of FRs, said sets of 
CDRs and FRs together comprising a single chain 30 
binding site immunologically reactive with a prese- 
lected antigenic site* said binding site having a 
binding affinity of at least 10 6 liters/mole, 

said polypeptide linker comprising plural, peptide 
bonded amino acids defining a polypeptide which 35 
spans the distance between the C-tenninus end of 
one pf said domains and the N-terminus end of the 
other of said domains when said binding site as- 
sumes a conformation suitable for binding, and 

40 



comprising hydrophtlic amino acids such that said 
linker assumes an unstructured polypeptide config- 
uration in aqueous solution, and 
another of said regions comprising an amino acid 
sequence peptide bonded to the N- or C-tenninus 
of said binding site, comprising a single chain poly- 
peptide having a conformation which confers bio- 
logical activity to said another region under the 
same conditions that allow binding of said binding 
site region to said preselected antigenic site, said 
biological activity being independent of said bind- 
ing site region. 

13. The single chain protein of claim 9, 10, 11, or 12 
wherein said another polypeptide region comprises a set 
of CDR amino acid sequences interposed between a set 
of FR amino acid sequences, said sets of CDR and FR 
sequences together comprising a binding site immuno- 
logically reactive with an antigen. 

14. The single chain protein of claim 9, 10, 11, or 12 
wherein said another polypeptide region is selected 
from the group consisting of sites for attachment to a 
remotely detectable moiety; sites for attachment to an 
immobilization matrix; and toxins. 

15. The single chain protein of claim 9, 10, 11, or 12 
wherein said another polypeptide region comprises a 
toxin. 

16. Hie single chain protein of claim 9, 10, 11, or 12 
wherein said another region is bonded to the N-ter- 
minus of said binding site. 

17. The single chain protein of claim 9, 10, 11, or 12 
wherein said another region is a polypeptide chain 
which binds to a site other than said preselected anti- 
genic site. 

18. The single chain protein of claim 9, 10, 11 or 12 
further comprising a radioactive atom. 

19. A DNA encoding the single chain protein of 
claim 9, 10, 11, or 12. 
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